CENSREC-4: development of evaluation framework for distant-talking speech recognition under reverberant environments

نویسندگان

  • Masato Nakayama
  • Takanobu Nishiura
  • Yuki Denda
  • Norihide Kitaoka
  • Kazumasa Yamamoto
  • Takeshi Yamada
  • Satoru Tsuge
  • Chiyomi Miyajima
  • Masakiyo Fujimoto
  • Tetsuya Takiguchi
  • Satoshi Tamura
  • Tetsuji Ogawa
  • Shigeki Matsuda
  • Shingo Kuroiwa
  • Kazuya Takeda
  • Satoshi Nakamura
چکیده

In this paper, we newly introduce a collection of databases and evaluation tools called CENSREC-4, which is an evaluation framework for distant-talking speech under hands-free conditions. Distant-talking speech recognition is crucial for a handsfree speech interface. Therefore, we measured room impulse responses to investigate reverberant speech recognition in various environments. The data contained in CENSREC-4 are connected digit utterances, as in CENSREC-1. Two subsets are included in the data: basic data sets and extra data sets. The basic data sets are used for the evaluation environment for the room impulse response-convolved speech data. The extra data sets consist of simulated and recorded data. An evaluation framework is only provided for the basic data sets as evaluation tools. The results of evaluation experiments proved that CENSREC-4 is an effective database for evaluating the new dereverberation method because the traditional dereverberation process had difficulty sufficiently improving the recognition performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -

Recently, speech recognition performance has been drastically improved by statistical methods and huge speech databases. Now performance improvement under such realistic environments as noisy conditions is being focused on. Since October 2001, we from the working group of the Information Processing Society in Japan have been working on evaluation methodologies and frameworks for Japanese noisy ...

متن کامل

Performance estimation of speech recognition based on acoustic parameters under reverberation environments with CENSREC-4

The Corpus and Environment for Noisy Speech RECognition 4 (CENSREC-4) evaluation framework has been distributed for evaluating distant-talking speech under various reverberation environments. The CENSREC-4 includes both real and simulated reverberant speech with convoluting impulse responses in the same environment. In addition, it consists of many room impulse responses to simulate various env...

متن کامل

Improve Speech Recognition Performance in Reverberant Environment Based on Estimation of Energy Feature

The purpose of this paper is to improve speech recognition performance in reverberant environment with distant talking, which based on directly speech processing or selected Log-energy feature. Speech recognition performance can be improved by changing the value of Log-energy feature or directly speech energy. Experiments used CENSREC-4 corpus to evaluate distant-talking speech under various re...

متن کامل

CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition

This paper introduces a common database and an evaluation framework for connected digit speech recognition in real driving car environments, CENSREC-2, as an outcome of IPSJ-SIG SLP Noisy Speech Recognition Evaluation Working Group. Speech data of CENSREC-2 was collected using two microphones, a close-talking microphone and a hands-free microphone, under three car speeds and four car conditions...

متن کامل

CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments

This paper introduces a common database, an evaluation framework, and its baseline recognition results for in-car speech recognition, CENSREC-3, as an outcome of the IPSJ-SIG SLP Noisy Speech Recognition Evaluation Working Group. CENSREC-3, which is a sequel to AURORA-2J, has been designed as the evaluation framework of isolated word recognition in real car-driving environments. Speech data wer...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008